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Abstract 

We consider the goodness of fit testing problem for ergodic diffu- 
sion processes. The basic hypothesis is supposed to be simple. The 
diffusion coefficient is known and the alternatives are described by the 
different trend coefficients. We study the asymptotic distribution of 
the Cramer-von Mises type tests based on the empirical distribution 
function and local time estimator of the invariant density. At partic- 
ularly, we propose a transformation which makes these tests asymp- 
totically distribution free. We discuss the modifications of this test in 
the case of composite basic hypothesis. 
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1 Introduction 

The goodness of fit (GoF) tests play a special role in statistics because they 
form a bridge between mathematical models and the real data. In classical 
situation of i.i.d. observations X n = . . . ,X n } and the basic hypothesis 
: the distribution function of Xj is F (x), the traditional solution is to 

construct a test statistics A n = D (P n , Foj based on some distance between 

the empirical distribution function F n (x) and the given (known) function 
F (x). Then the test function is defined by ^ n = l{A n >c e }, where the con- 
stant c e is chosen from the condition: lim^oo P {A„ > c e } = e, e £ (0, 1). 
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The diversity of tests comes from the diversity of distances. At particularly, 
if we take 



n 



H(x) 



x 



Fn(x) 



-i 2 



dF n fx) 



(1) 



then we obtain the well-known Cramer- von Mises family of statistics [16] . If 
H (x) = 1, then we have Cramer- von Mises test and if the weight function 
H (x) = (F (x) [1 — F (x)]) _1 we obtain the Anderson-Darling test. 
In the case of uniform metric 



A n = sup y/n 



F n (x) - F (x) 



(2) 



and ty n = 1{a„>c.} we have Kolmogorov-Smirnov test. Remind that the tests 
based on these statistics are asymptotically distribution free (ADF) and for 
continuous F (x) under hypothesis Mo we have the convergence 



n 



n 



F n (x) - F n (x) 



F n (x) - F (x) 



F (x) [1 - F (x) 



dF n (x) 



dF (x) 



W (s) 2 ds 
1 W (s" 2 



8(1 



ds, 



sup y/n 



x 



FJx) 



sup \Wq (s) 

0<s<l 



where W (x) is Brownian bridge. These last property of the statistics allows 
us to chose once the constant c E for all F (x). Note as well that the both 
statistics tend to oo for any fixed alternative F (•) ^ F (•) and this property 
provides the consistency of these tests. 

The present work is devoted to the similar problem but in the case of 
continuous time observations X T = {X t , < t < T} of ergodic diffusion pro- 
cess 

dX t = S (X t ) dt + a (X t ) dW t , X , < t < T. (3) 

The diffusion coefficient a (-) 2 is supposed to be known and the hypothesis is 
concern the trend coefficient S (■) only. That means, that the basic hypothesis 
is simple: 

: The observed trajectory X T is solution of the stochastic differential 
equation 

dX t = So (X t ) dt + a (X t ) dW t , Xo, < t < T 

where So (x) is some known function. 

The alternative corresponds to the case S (■) ^ So (■)• 
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We suppose that the trend (under hypothesis and alternative) and diffu- 
sion coefficients satisfy the conditions: 

£S. The function S (•) is locally bounded, the function a (-) 2 is continuous 
and positive and for some A > the inequality 

xS(x) + a(x) 2 < A (l + x 2 ) 

holds. 

This condition provides the existence of unique weak solution of this 
equation (see [3J, P- 210). 

Moreover, we suppose that the following condition is fulfilled too. 
TZV. The function 

V(S,x)= [ exp(-2 F-^^dz] dy — ► ±oo 



as x ^ ±oo and 



o I Jo o-(z) 2 



G(S)= I o (y)-' 2 exp <j 2 F ^4 dz \ d V < 00 



poo f py 

I a(y)- 2 expl2j 



a(zY 




By this condition the observed process is recurrent positive and has er- 
godic properties with the density of invariant law 

f (%) — o exp 1 2 , 

G(S)a(x) 2 y \ Jo a(y) 

The corresponding density and distribution function under hypothesis 
we denote as fo (x) and Fq (x) and the mathematical expectation as Eo. 
Moreover, we suppose that the initial value Xq is a random variable with 
this distribution function because this condition simplifies exposition (the 
observed process is stationary). 

Let us fix some e G (0, 1) and denote by JC £ the class of tests ipr of 
asymptotic size e, i.e.; EoV't = e + o (1). We are interested by the GoF tests 
of asymptotic size e, which are ADF. 

The problem of goodness of fit testing can be considered as follows: let 
us introduce some statistic 5t such that its limit distribution G (x) under 
hypothesis does not depend on the model, then the test ipT = l{<5 T >c e }, where 
the constant c £ is solution of the equation 1—G (c e ) = e is ADF. Moreover, we 
require as well that for any fixed alternative (defined by the trend coefficient 
S (■)) we have P {5t > c £ } — > 1, i.e.; the test is consistent. 
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Let us remind here some of known tests satisfying these conditions. The 



first ones ipr {X T ) = 
following two statistics 

1 



l {S T >de} 



and <p T (X T ) = 

l{7 T >c e } are based on the 



t r 



7t 



T 2 E [a(0 2 ] 
1 

sup 



Xt — Xq — 

Xt — x 



S (X v ) dv 



dt. 



S (X V ) dv 



Here and in the sequel £ is the random variable with the density fo 
It is shown that under hypothesis Mo 
•i 



w (v) dv, 7t 



o 



sup \W{V) 
0<v<l 



Hence the constants c £ and d £ are defined by the equations 

\2 



w (v) dv > d £ 



o 



sup \w [v)\ > c £ 

0<v<l 



e. 



(4) 



. These tests belong to /C e and are consistent against any alternative S (•) ^ 
S (•) such that EoS* (£) ^ 0. Moreover, it is shown that the test ipx {X T ) is 
asymptotically optimal in special sense (see [2] for details). 

Another GoF test ifT = l|A T >e } was P ro P ose d by Negri and Nishiyama 
1191. It is based on the statistic 



TE a (0 J 



sup 



L {X t < 



x} [dX t -S (X t )dt] 



(5) 



which converges to A = sup 0< , )<1 |iy(f)|. The constant c £ is defined in 
(Tj0). This test belongs to K £ and is consistent against any fixed alternative 
satisfying condition: for some x we have E (l {£<£•} [S (£) — S (£)]) ^ 0. 

Note that the similar question of the goodness of fit testing for ergodic 
diffusion processes by discrete time observations was extensively studied (see 
Chen and Gao pQ and references therein). 

The goal of this work is to study the tests which are direct analogues of 
the classical GoF tests like Anderson-Darling ([1]) and Kolmogorov-Smirnov 

To test the hypothesis Mo we propose two tests of Cramer-von Mises 
type. The first one is based on empirical distribution function (EDF) 

1 f T 

f t{x) = - / l {Xt <x}dt 
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and the statistic is similar to ([T]): 



T 



H(x) 



F T (x)-F (x) dF (x). 



The second test is based on local time estimator (LTE) fx (x) of the invariant 
density, which can be written as 



fr(x) 



At (x) 
T a (x) 2 



\Xt — x\ — \Xq — x\ 
T a (x) 2 



1 



T a (x) 



sgn (X t - x) dX t 



Here At (x) is the local time of the diffusion process. The corresponding 
statistic is 



T 



h (x) f T (x) - f (x) 



dF n (x) 



We discuss as well the Kolmogorov-Smirnov type test with the test statistic 



(6) 



7r = sup VTg (x) f T (x) - f (x) 



The goodness of fit tests are = l{A T > Ce }, and tpr = ^{8 T >d £ } and ipT = 
l{ 7T>ee }. These tests with H (x) = l,h(x) = l,g(x) = 1, were proposed in 
[2], but unfortunately they are not ADF (see as well [6], [7j, where similar 
test statistics are discussed). 

The test <^r, © is ADF. It is interesting to see the relation between the 
statistics (jSJ) and (EJ). Suppose that a (x) = 1 and remind that 



It(x) = - 



T 



1 {X t <x} S {X t ) dt 



are unbiased estimators of the invariant density |13j . The first is asymptoti- 
cally equivalent to the local time estimator fa (x), i.e.; 



T [f T (x)-f T (x))=o(l) 

and for the second we have 

f* (x) — . 2E l {?<x} So (0 = f (x) . 

Of course, the second estimator is not indeed estimator of the invariant den- 
sity because it uses the function So (x), but this choice of the second term 
makes the statistic asymptotically distribution free. Therefore 



A 



r 



sup\f T (x) - fr(x)\ . 



The random processes {t]t (x) , x G 7Z} and {(t (x) ,x & 7Z} where 



Vt (x) 



F t (x) - F n (x) 



and Ct (#) 



/t (a;) - fo {x) 



converge to the Gaussian processes {rj (x) , x G 7^} and {( (x) , x G 7£} with 
quite complicate structure (for the first convergence see [IB] and for the 
second see [H]). For example, 



EoC(z)C(j/)=4/o(x)/ (j/) 



(!{,>,} -Fo^)) (l {z > y} -F (z)) 
<r(*) 2 /o(z) 



d2 



The goal of this work is to show that for certain choice of weight functions 
H (x) and h (x) the tests = l{A T >c e } and ifc = l{s T >d £ } can be ADF and 
consistent against some alternatives. 

We discuss as well some other ADF GoF tests. 



2 GoF tests based on LTE 

Let us denote by fi the median of the invariant law : F (/x) = 1/2. We 
consider a special hypotheses testing problem, when the trend coefficient is 
fixed for x < fi or for x > fi. Therefore, we consider one sided alternatives 
only. Suppose that the changes in the trend coefficient are possible for x > 
[A, i.e., the values of S (x) for x < fi are the same under hypothesis and 
alternative. 

We study first the Cramer- von Mises type test ipr = ^-{s T >d £ }, where the 
statistic 

5 T = T ! h(x) (/ r (x) - f (*)) 2 dF (x) 



with 



and 



h (x) = 2F ° ^ - 1 e -*(*)/*M 1 , > , 

o-(x) f (x) 



i 2 



(l {y>x} - F (y) Y 

°(y) 2 fo (y) 

The study of this test is based on the asymptotic normality of the LTE: 



T (/ T (*) - / (*)) C (a;) ~ N (0, (a;) 2 ) , 

where df(x) 2 = 4fn (x) 2 $ (x). But we have two other classes of density 
estimators, which are consistent and asymptotically normal with the same 
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limit distribution. These are kernel type estimators 

r-T 



f T (x) = -L^ K(ff{X t -x)) dt, 



and the unbiased estimators 



Hx) = - 



T 



{x t < x} KX t ) 1 

v rj~\ 



T J a(x) 2 h(x) 



T \ {Xt < x} h'(X t ) a(X t ) 2 
<j(x) 2 h(x) 



dt 



where h (•) G C is an arbitrary function. Under mild regularity conditions 
(see (14], Propositions 1.58 and 1.61) we have the same asymptotic normality 



and 



T {Jt (x) - / (x)) =► N (0, dj (xf) 



Af(0,d f (x) 2 ) . 



T[f T (x)-f(x] 



Therefore, the results obtained for the test ipT based on the LTE can be valid 
for the tests based on the kernel-type or unbiased estimators as well (under 
strengthen conditions) . 

The calculation of the statistic St can be slightly simplified. Remind the 
equalities 



T 



9 (X t ) dt 



g (x) f T {x) dx 



g (x) dF T (x) . 



Hence 



6 T = T h(x)(f T (x)-f Q (x)) dF T (x 



+ T 

r-T 



h(x)[f T (x)-fa(x)) d F Q (x)-F T (x) 

1 



h(X t )[f T {X t )-f (X t )\ dt- 
h(X t ) \f T (X t )-f (X t )] 2 dt + o(l) 



h (x) (t (x) dx 



and we can use the statistic 

r-T 



h(X t ) f T (X t )-f {X t 



dt 



(7) 
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in the construction of our GoF test. To avoid the difficulties with calculation 
of stochastic integral related to fx (X t ) we can use here the kernel type 
estimator of the density. 

Note that the LTE of the density is the derivative with probability 1 of 
the EDF. Indeed, we have the equality (r > 0) 

F T (x + r)-F T (x) 1 f T 1 f x+r f 

= -7F / Mx<x t <x+r} dt = - f T (y) dy 

r rT Jo ~ ~ r J x 

and, as the local time is continuous with probability one (see [20]), we have 

lim - / f T (y) dy = f T (x) . 



Let us define the constant d e as solution of the equation 

w(v) 2 e~ v dv > d e 



where w (•) is a Wiener process and introduce the following condition. 
Condition A. The function $ (x) < oo for all x G (/i, oo) and 

(2F (x)-l)$(x) *w 



A 1 = / k k k e dx < oo, 



<S>(x) 



A tW'»W'AW U> ff(«) 2 /o('f) / 

Proposition 1 Let t/ie conditions £S, TZV and A be fulfilled, then the GoF 
test tp T = l{6 T >d E } is ADF. 

Proof. We have to prove the convergence (under hypothesis J#o) 

w (v) 2 e~ v dv. 

First we remind some properties of the local time estimator of the density 
(under hypothesis J£q). The following presentation is valid (see [H], p. 29) 



8 



in obvious notation. We supposed that the process X t ,t > is stationary, 
hence 

'J °\ VT Jx a(vfh(v) 

~ T Vio o (vf /„ (v) 
Further, by condition (J9j) we have 

4 r h ( x) /o (x) 2 E ( f 1{ 7\" F ; ( ; } 2 ^ 

r(2F,(*)-l)e-g3 E| (fV)-F,W di<oc 

Hence 

A 2 



J°°h(x) E (4 2) ) 2 dF (a;) < 



T 

Therefore it is sufficient to study the first integral in (JTU1) . i.e., 
Let us denote 

1 /" T l{X t >a:} - -Ft) PQ) 



VT Jo ^ C^tJ Jo (A t ) 



By the law of large numbers we have the convergence 

hence we can apply the central limit theorem (see, e.g., [14J, Theorem 1.19) 
and to obtain the convergence of all finite dimensional distributions of Cr (•) 
to the multidimensional Gaussian law: 

Ct (ari) , . . • , Ct (a*)) (C (»i) , • • ■ , C (a*) J , (11) 

where the Gaussian process £ (x) is with zero mean and it's covariance func- 
tion is 

*<*,,) = r (l{ -" F ° ( i (l| " ri " F ° ( " )) 

J-oo o-(u) /o(v) 



The process £ (x) admits the representation 

cm = f°° ^-^Lh w,. 

J-oc o (y) V/o \y) 

The last integral is with respect to double-sided Wiener process, i.e.; W y = 
W + (y) , y > and W y = W~ (— y) , y < 0, where W 7- (•) and (•) are two 
independent Wiener processes. 

The convergence fill I) formally can be obtained as follows (egalities in 
distribution) 



oo \ 

oo 



■/-oo CT (y) V /O (y) 

where W (s) ,s > is some Wiener process. We used above the following 
property of local time of ergodic diffusion process : for any integrable function 

<?(•) 

1 ' T 



- / g{X t )dt = g (y) / T (y) dy — / <? (y) / (y) dy. (12) 

1 JO J-oo J-00 

To verify the convergence 

POO 

h (x) f {xf C (x) 2 dF (x) = 4 / h (x) / (a;) 3 W ($ (x)) 2 dx 
we note that for any e > there exists L > fi such that 

/•oc 2 

y /i^Eo^W) dF (x)< £ . 
Therefore it is sufficient to show that for any L > // 

L (x) / {xf ( T {xf dF (x) =► / h {x) f {xf C (x) 2 dF (x) . 

J n 
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This last convergence follows from the the convergence of finite dimensional 
distributions ffTTj) and the estimate 



E 



Cr O2) 2 - Cr {x\Y 



< C L \x 2 - xi 



1 1/2 



(13) 



which is valid for all \xi\ < L (see [9], Theorem 9.7.1). The estimate ( TT31) can 
be obtained as follows. We have 



Er 



Ct (x 2 ) 2 - Ct (xi) 2 



Ct (x 2 ) + Ct (xi) 
2 



< E Cr {x 2 ) - Cr (xi) E c 

< E |Ct (x 2 ) - Cr (xi))' (2EoCt (x 2 y + 2E Ct (xi) 

< (2$ (x 2 ) + 2$ ( Xl )) E |Ct (x 2 ) - Ct (xif 
Further (let x 2 > x\) 



E 



Ct (x 2 ) - Cr {x\ 



Er 



1 



1 



{xi<X t <X2} 



^fj a(X t )f (X t ) 
2 — \x 2 — X\ \ . 

xi V(V) fo(v) 



Remind that h (■) is continuous on L]. 

The last step is to verify the equality (in distribution) 



h(x)f (x) 2 C(x) 2 dF (x) 



w (v) 2 e v dv. 



For the function 
$ (x) = 



(1 



{y>x} 



Fo(y)Y 



&(y) fo(y) 



dy 



Fo (yY 



°{y) fo(v) 



dy + 



(1 - F {y)f 

o {yf h {y) 



dy 



we have 



$' (x) 



2F (x) - 1 
a (x) 2 f (x) 



< 0, for x < fi 



and $' (x) > for x > fi. Hence the functions $ (x) , x < fi and $ (x) , x > \i 
are strictly monotone (decreasing and increasing respectively). Moreover, 
$ (±00) = 00. 
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We can write 

h(x) f (x) 3 W ($(x)) 2 dx 

2 j- / \4 



4 r h (x)a(x) f (x) 2 



e -zMM) dz= r w ( v )2 e -v dv _ 



To show the consistency of this test against any fixed alternative 

M{ : S(x)± S (x) 

we just note that 

VT (J t (x) - f (x)) = VT (/t (x) - / (x)) + VT (f (x) - / (x)) 

where the first term is asymptotically normal and for the second term we 
have 

/■oo 

T h{x) (f (x) - f (x)) 2 dF (x) — oo. 

J p 

Here / (x) is the invariant density function (under alternative). Of course, we 
have to suppose that the corresponding integrals like §B§ and © are finite. 

We now apply the similar arguments to study the Kolmogorov-Smirnov 
type statistic 

7t = sup Vfg (x) f T (x) - f (x) . 

Our goal is to chose such weight function g (•) > that the GoF test tpT = 
l{ 7T> e e } based on this statistic be ADF. Let us put a (x) = 1 (for simplicity). 
Remind that the weak convergence 

Ct(0=K(0 

in the space of continuous functions vanishing at infinity was already proved 
in [Hj, Theorem 4.13. This convergence provides as well the convergence of 
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our statistic 77- with g(x) = 1. Suppose that the function g (•) is such that 
we have the convergence 7t =^ 7o = su Pa; g{x) |C (^)l too. Let us put 



g(x) 



-*(*)/*(/*) 



2/0 (x) v 7 ^) 

Then we can write (egalities in distribution) 

Uy>x} - F o (y) 

> o- (y) v 7 /o (y) 



70 = sup (x) /o (x) 



= su P 2( 7 ( a ;)/o(a:) |W($(a;))| 

= sup J — -^==^- e = sup | w e 

X>/J, y/^f (/i) t)>l 

We see that the test tpr = l{7 T >e £ } with e £ from the equation 

P < sup \w (v) I e~ v > e £ > = e 
U>i J 

is ADF and belongs to /C £ . 

Remark. Note that these arguments do not work directly in the case of 
double sided alternatives, i.e., if the function S (x) changes under alternative 
for the values x < ji too. It can be shown that the limit 



/oo 
[w 1 (v) 2 + w 2 (v) 2 ] e~ v dv 



holds, but the Wiener processes u>i (•) and W2 (•) are correlated and the cor- 
relation function depends on the model. 



3 GoF test based on EDF 

We study the GoF test ^ T = l{A T > Ce } with Cramer-von Mises type statistic 

A T = T I H(x) (P T (x) - F (x)) * dF (x) 

and our goal is to chose such weights H (•) > that this statistic converges 
to the distribution free limit: 



/oo 
w(v) 2 e~ v 



dv. 
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This statistic can be written in the empirical form like (|7|) 
A* T = £ H (X t ) (P T (X t ) - F (X t )) 2 dt 

because Ay = A^ + o (1) with the same explication as above. 

The properties of this statistic are quite close to that of St, that is why 
we do not give here all details. 

We suppose that the conditions (oS, RP and $ (x) < oo are fulfilled and 

y, * w /o w e ° { — owm5 — ) dx < 00 <i4) 

The process r] r (x) = -\/T (^F T (x) — F (x)J admits the presentations (see 
p. 85) 



VT Jo CT \ x t) Jo \X t ) 



+ 2 ^F,(,A I )-F, W F 1 ( I ) Ji 
VTJxo <t(v) fo(v) 

Note that if $ (x) < 00, then (see (HJ, Remark 1.64) 

, , x2 _ / F (QFo(x)-Fo(eAa:) V ^ 

Mx) = 4E °l ^M) J <0 ° 

too and 

VT (P t (x) - F Q (x)\ A/" (0, d F (x) 2 ) . 
It can be shown that (under mild regularity conditions) 

POO 

A T A = / H (x) f (x) fj [xf dx, 
J n 

where 

fj (x) = [F (x) - 1] f F ° dW y + F (x) /°° dW y . 

J-00 a (y) x //o (y) A cr (y) J f a (y) 



Hence 



POO 

A = / if (x) /o (x) [F (x) - l] 2 W (* (x)) 2 dx, 
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where 



•/-«> a (y) /o (y) A V-Po (x) - I J 



v(yffo(y) A \F (x)-lJ a(y) 2 f (y)' 



Further 



Ao = J f gWAWgW-i]V (»(,))'d» W 

/■oo 

= (0) -2 / W (* (x)) 2 e"*^/*W (x) 
u> (f ) 2 e _-u df , 

where we put \I/ (x) = u 1 ^ (//), w (v) = \I/ (/i) W (v*f> (//)) and 

# (V) = — ^ -*(a)/*(0) 

* (/i) 2 /o (*) [F (s) - l] 2 

Of course, we suppose that the function (x) , x > fi is strictly monotone 
and \& (oo) = oo. 



4 Examples 

Let us consider two examples. The first one is 

Example 1. Ornstein-Uhlenbeck process. Suppose that the observed 
process under the basic hypothesis is 

dX t = -a (X t - b) dt + adW t , X , < t < T, 

where a > 0. The invariant density is Gaussian / Q (x) ~ Af (^b, ^\ with 

median fi = b. To check the conditions (jHJ) and @ we estimate first the 
asymptotics of (x) as x — > oo. We have 

4W= r (iw-fofa)) 2 r _j_ (1 + 0(1)) 



- l + o 1 

X 



Hence the both conditions are fulfilled and the test ipr = l{<5 T >d £ } with 
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is ADF. 

It is easy to see that the conditions (TT3|) and ffTo"]) are fulfilled too and the 

test ^ T = l{A T >d e } with 

At = T r — ^ e -*(*)/*(o) ( ^ T f x ) _ p. ( X )Y dx 

T J b W(u) 2 \F (x)-l} 2 \ T[) 0{) ) 



is ADF. 



Example 2. Simple switching process. Suppose that the observed 
process under hypothesis is 

dX t = -asgn{X t - b) dt + a dW t , X , 0<t<T 

where a > 0. The process is ergodic with invariant density 

fa ( x ) = -5 ex P I \ x ~ b \ 



9 ~*-r 1 o 



and median fi = b. The function 

* (x) = f — dy (1 + o (1)) = a- 1 e** (1 + o (1)) 

i-00 ^ Jo (y) 

as x — > 00 and the conditions (jEJ), © and ( IT31) . ( fl5l) are fulfilled. The direct 
calculation shows that ^ (x) is strictly monotone function. Therefore the 
corresponding tests ipT and ^>t are ADF. 

The limit distribution of the test statistic 8? with h (x) = 1 were studied 
by Gassem [8], who obtained the Karhunen-Loeve expansion for the limit 
Gaussian process C (')■ 

5 Composite hypotheses 

Suppose that the observed diffusion process (under hypothesis J£q) is 

dX t = S(ti,X t ) dt + a(X t ) dW t , X , < t < T 

where d is unknown parameter d G (a, b). Therefore the basic hypothesis is 
composite. The test statistic can be 

5 T = T J™ h (ti T , x) (f T (x) - f o T , xfj 2 dF (d T , x), 
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where $t is some consistent and asymptotically normal estimator of d and 
h (•&, x) is the same function as before with obvious modification, say, f (x) = 
fo ($, x) . Unfortunately the test based on this statistic is no more ADF 
because its limit distribution depends on the distribution of estimator. To 
compensate this contribution of estimator we can modify this statistic as 
follows (see, e.g., Koul [12] for similar transformation in time series). Suppose 
that $t is the MLE and the corresponding regularity conditions are fulfilled 
(see 1141, Theorem 2.8). Then we have 



vfU T -A = m- 1 T- i/2 j ^-^W+o(i), 

v J Jo a ( x t) 

where dot means derivation w.r.t. 

We want to substitute here the MLE, but in this case the stochastic 
integral is not well defined, that is why we first rewrite this integral in the 
following form (Ito formula) 

Rt (*) = f S -^_ dWs = f ^Mil [dXs _ S (tf, X.) ds] 
Jo v{*s) J a(X s ) 

Xt S(0,y) dy j T S' (g, X 8 ) a [Xs) - 2S (g, X s ) a' (X x ) ^ 



r 



a{yf Jo 2a (X 8 

S($,X S ) S(#,X a ) 



ds. 



o [X s f 

Here prim means derivation w.r.t. x. The last expression contains no stochas- 
tic integral and we use it as definition of Rt ("&), where we can put $t- Now 
we can introduce the test statistic 



6 T = T j h (tf T , z) (/ T (x) - / O (0 T , x) 

+fo(#T,x) 10 T ) T- 1 R t ($t)) 2 dF (S T ,x). 



Note that 

It (x) - fo$ T , x) = It (x) - f (&, x) + f (■&, x) - f $ T , x) 
= It (x) - /„(#, x) - I (# T , x)$ T - 0) (1 + o (1)) . 

Hence 

S T = T [ h(0,x) (f T (x)-f (#,x)) 2 dF (#,x) + o(l) 



oo 



w (v) 2 e dv 
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and the test ip T — 1 



{s T >d e } 



is ADF. 



We supposed here that the median \x does not depend on •& (as in Example 
1 with a = °d and fi = b). 

In the case of Example 2 the situation is different. Suppose that d is the 
shift parameter: 

dX t = -a sgn (X t - ti) dt + a dW t , X , < t < T. 

Then we can use the statistic 

5 T = T I hU T ,x) (/ r (x)-/o(^T,x)) 2 dF (^ r ,x), 

•/ 17 f 

and it can be shown that 



w (f) 2 e v dv . 



Indeed, the MLE converges to d with the rate T (and not \/T) (see 
[H], Theorem 3.26) and its contribution to the limit distribution of 5t is 
negligible. 

Let us see what happens under alternative 



S(-) = SJ-) 



S* (■) e 



where the set 





: inf 







S (■&,-) -5 1 



a 



> 



and we suppose that the function 5* (■) satisfies the conditions £S and VSP . 
Therefore the invariant density is f$ t (■)• Here the norm 



\h(-) 



h (x) 2 fs, (x) dx. 



Note that the MLE in this "misspecified situation" converges to the value 
which minimizes the Kullback-Leibner distance 



S(-) 



a 



inf 



S(-) 



a 



(17) 



(see [Hj, Section 2.6.1 for details). 
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Hence, under regularity conditions we have 

f T (x)-faU T ,x) -^/ S ,(x)-/ (^,x), I(i? r ) 



and /o fi?T,^l — ► /o ($*,x)- Further, it can be shown that 



Rt \#t) i rT S (# T , X t j )S* (X t ) - S (& T , X, 
T = fj 7(XJ 2 

1 [ T S(#*,X t )[S*(X t )-S(A,X t )} 



S x) [5* (a;) — 5 (■#*, x)] 
cr (x) 2 



dt(l + o(l)) 



dt(l + o(l)) 



/ s . (x) dx = i?(tf*)- 



Therefore, 



/i (0„ x) / s , (x) - /o 0?*, x) + /„ (0., x) I (0„) R (0, 



dx 



and this test can be non consistent against alternatives 5* (•) such that 
/s. (x) - /o (#., x) + /o (#., x) F 1 (tf*) J2 (tf*) = 0. 

Suppose that is an interior point of 6 and show that the last equality is 
impossible. The value defined by the equation ( fT7|) is the same time one 
of the solutions of the equation 



S x) [S* (x) - S x)] 
ct(x) 2 



fs, (x) dx = 0. 



Hence = 0. The equality f s „ (x) = f (/#*,x) implies 



S* (y) 



dy 



o o{y) 



which gives us S* (x) = S ($*,x) for almost all x and the last equality con- 
tradicts the definition of alternative. Therefore, 5t — > oo and the test ipT is 
consistent. 
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6 Discussion 



Note, that the similar problems of ADF GoF tests for stochastic differential 
equations with "small noise" are considered in |15j . 

The tests ipx = l{<5 T >d e } an d = l{A T >c e } studied in this work are 
consistent against any fixed alternative and it can be easily shown, that 
these tests are uniformly consistent if the alternatives are separated from 
hypothesis as follows 

J% : S(.)EF r = {S(.):\\fs(-)-f (-)\\>r} 
with some r > for ipT or 

*X - S(-)eF q = {S(-):\\F s (.)-F (.)\\>q} 
for with some q > 0. Here the norm 

/oo 
m (x) 2 dF (x) . 
-oo 

This means that 

inf P s {5 T >d e }— >1, inf P s {A r >c B }— ^1. 

But if the alternative is defined by the Kullback-Leibner distance (s > 0) 

then the both tests are no more uniformly consistent. For example, the 
functions 

S n (%) = So i x ) + ao ~ i. x ) 2 cos i nx ) j n = 1, 2, . . . 
can belong to TC S but 

ll/A.(0-/o(OII-0 as n^oo 

and 

inf Ps„ {6t > d e } — ► e as T — >• oo. 

For such alternatives it is better to use the Chi-squared tests, which can 
be even asymptotically optimal in minimax sense. The construction of such 
tests for signals in white Gaussian noise can be found in Ermakov [3] and 
Ingster and Suslina [H]. For inhomogeneous Poisson processes see pi)]. It is 
interesting to study such tests in the case of ergodic diffusion processes too. 



S\ 



Sn 



> S 



a 



20 



References 



Chen S. X., Gao J. (2008) A test for model specification of diffusion 
processes. Ann. Statist. 36, 1, 167-198. 

Dachian, S. and Kutoyants, Yu.A. (2007) On the goodness-of-fit tests 
for some continuous time processes, in Statistical Models and Methods 
for Biomedical and Technical Systems, F.Vonta et al. (Eds), Birkhauser, 
Boston, 395-413. 

Durett, R. ( 1996) Stochastic Calculus: A Practical Introduction. Boca 
Raton: CRC Press. 

Durbin, (1973) Distribution Theory for Test Based on the Sample D.F. 
S1AM, Philadelphia. 

Ermakov, M.S. (1990) Minimax detection of a signal in a Gaussian white 
noise. Theory Probab. Appl., 35, 667-679. 

Fournie, E. (1992) Un test de type Kolmogorov-Smirnov pour proces- 
sus de diffusions ergodic. Rapport de Recherche, 1696, 1NR1A, Sophia- 
Ant ipolis. 

Fournie, E., Kutoyants, Yu. A. (1993) Estimateur de la distance mini- 
male pour des processus de diffusion ergodiques. Rapport de Recherche, 
1952, 1NRIA, Sophia-Antipolis. 

Gassem, A. (2008) Goodness-of-Fit test for switch- 
ing diffusion, prepublication 08-7, Universite du Maine 



(http: / /www. univ-lemans.fr/ sciences/ statist /download/ Gassem/article_anis.pdf). 



Gikhman, I.I. and Skorokhod, A.V.. (1969) Introduction to Theory of 
Random Processes. Philadelphia: W.B. Saunders. 

Ingster, Yu. I. and Kutoyants Yu. A., (2007) Nonparametric hypothesis 
testing for intensity of Poisson process, Mathem. Methods Statist, 16, 
217-245. 

Ingster, Yu.I. and Suslina, I. A. (2003) Nonparametric Goodness-of-Fit 
Testing Under Gaussian Models, Springer, N.Y. 

Koul, H. L. (1992). Weighted Empiricals and Linear Models. I. M.S. Lec- 
ture Notes, 21. Hayward, California. 



21 



[13] Kutoyants, Yu.A. (1997) Some problems of nonparametric estimation 
by the observations of ergodic diffusion processes. Statist. Probab. Lett., 
32, 311-320. 

[14] Kutoyants, Yu.A. (2004) Statistical Inference for Ergodic Diffusion Pro- 
cesses, Springer, London. 

[15] Kutoyants, Yu.A. (2008) Goodness-of-fit tests for perturbed dy- 
namical systems, prepublication 08-5, Universite du Maine 
( ]http: //www.univ- lemans.fr/sciences/statist/download/ /Kutoyants/G oF.pdf ). 

[16] Lehmann, E.L. and Romano, J. P. (2005) Testing Statistical Hypotheses. 
(3rd ed.) Springer, N.Y. 

[17] Liptser, R.S. and Shiryayev, A.N. (2001) Statistics of Random Processes. 
I, (2nd ed.) Springer, N.Y. 

[18] Negri, I. (1998) Stationary distribution function estimation for ergodic 
diffusion process. Stat. Inference Stoch. Process., 1, 1, 61-84. 

[19] Negri, I. and Nishiyama, Y. (2008) Goodness of fit test for ergodic 
diffusion processes. To appear in Annals of the Institute of Statistical 
Mathematics . 

[20] Revuz, D. and Yor, M. (1991) Continuous Martingales and Brownian 
Motion. Springer, N.Y. 



22 



